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Background and purpose — The increased use of patient- 
reported outcomes (PROs) in orthopedics requires data on esti- 
mated minimal clinically important improvements (MCIIs) and 
patient-acceptable symptom states (PASSs). We wanted to find 
cut-points corresponding to minimal cHnically important PRO 
change score and the acceptable postoperative PRO score, by 
estimating MCII and PASS 1 year after total hip arthroplasty 
(THA) for the Hip Dysfunction and Osteoarthritis Outcome Score 
(H00S)andtheEQ-5D. 

Patients and methods — THA patients from 16 different 
departments received 2 PROs and additional questions preop- 
eratively and 1 year postoperatively. The PROs included were the 
HOOS subscales pain (HOOS Pain), physical function short form 
(HOOS-PS), and hip-related quaUty of life (HOOS QoL), and the 
EQ-5D. MCn and PASS were estimated using multiple anchor- 
based approaches. 

Results — Of 1,837 patients available, 1,335 answered the 
preoperative PROs, and 1,288 of them answered the 1-year fol- 
low-up. The MCIIs and PASSs were estimated to be: 24 and 91 
(HOOS Pain), 23 and 88 (HOOS-PS), 17 and 83 (HOOS QoL), 
0.31 and 0.92 (EQ-5D Index), and 23 and 85 (EQ-VAS), respec- 
tively. MCIIs corresponded to a 38-55% improvement from mean 
baseline PRO score and PASSs corresponded to absolute follow- 
up scores of 57-91% of the maximum score in THA patients 1 
year after surgery. 

Interpretation — This study improves the interpretability of 
PRO scores. The different estimation approaches presented may 
serve as a guide for future MCII and PASS estimations in other 
contexts. The cutoff points may serve as reference values in reg- 
istry settings. 



In recent years, there has been a shift towards a more patient- 
centered perspective in orthopedic research and the use of 
patient-reported outcomes (PROs) has increased (Horan 2010, 
Wylde and Blom 2011). This has led to a discussion on how 
best to interpret PRO results (McLeod et al. 2011), because a 
statistically significant change in PRO score does not necessar- 
ily represent a clinically important improvement, and it can be 
difficult to know if a certain postoperative PRO score is accept- 
able from the patient's point of view. Since it can be problem- 
atic to interpret change in scores and absolute postoperative 
scores in a clinically meaningful way (Quintana et al. 2012), 
different cut-points can be determined. One of these cut-points 
is the minimal clinically important improvement (MCII), a 
PRO change score value defined as the minimal change repre- 
senting a clinically important improvement from the patient's 
perspective (Tubach et al. 2009). Another cut-point is the 
patient-acceptable symptom state (PASS), a value of the post- 
operative PRO score found acceptable by the patients, defined 
as the overall health state at which patients consider themselves 
to be feeling well (Maksymowych et al. 2010). Both MCII and 
PASS estimations will be of future importance in research and 
clinical practice because they focus on the patient perspective 
of total hip arthroplasty (THA). There is a lack of patient-based 
cut-points in the musculoskeletal literature (Keurentjes et al. 
2012), and the cut-points may vary depending on context even 
for a single PRO, making MCII and PASS estimations for THA 
patients warranted. 

Our aim was to find cut-points for the minimal clinically 
important improvement based on changes in PRO scores and 
the acceptable postoperative PRO score, by estimating MCII 
and PASS 1 year after THA for 2 commonly used PROs, the 
Hip Dysfunction and Osteoarthritis Outcome Score (HOOS) 
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and the EQ-5D. In addition, PASS was estimated for sub- 
groups of age, sex, and diagnosis. 



Patients and methods 

Data collection 

All patients older than 18 years receiving a THA in 16 selected 
orthopedic departments in Denmark, from March 1, 2010 to 
March 1, 201 1 were invited to participate in the study. Patients 
with acute fractures were excluded, as they are not compa- 
rable to patients undergoing THA for other causes. Preop- 
erative assessment included a disease-specific PRO (HOOS 
subscale pain (HOOS Pain), HOOS physical function short 
form (HOOS-PS), and HOOS subscale hip-related quaUty of 
life (HOOS QoL)), a generic PRO (EQ-5D), patient informa- 
tion, questionnaire instruction, additional questions regarding 
patient characteristics, and a written consent form. Only first- 
completed PROs were included if there was bilateral surgery. 

1 year postoperatively, the patients received the same PROs 
and 3 anchor questions. The postoperative questionnaires were 
mailed in paper form to the patients by regular post, includ- 
ing a return addressed envelope with a stamp. 2 reminder let- 
ters were sent if necessary (Edwards et al. 2009). All returned 
questionnaire forms were scanned electronically using a vali- 
dated automated forms-processing technique (Paulsen et al. 
2012a). 

Anchor questions 

Anchor questions were used to establish an external patient- 
reported reference in order to evaluate clinical interpretation 
of PRO change scores and postoperative PRO scores. The 
anchor questions had already been used in "Questionnaire for 
patients who have had hip surgery" from the Royal College 
of Surgeons of England (Clinical Effectiveness Unit 2009). 
Since both hip-specific and generic PROs are used to evalu- 
ate the outcome following THA, we chose to estimate MCII 
and PASS for the HOOS and the EQ-5D in comparison to the 

2 hip-specific anchor questions. However, since hip-specific 
PROs cannot be used to assess general health, MCII and PASS 
in relation to the general-health anchor question were only 
estimated for the EQ-5D. 

Anchor questions used for definition of MCII. A patient- 
reported hip-specific anchor question describing change in 
hip problems from preoperatively to 1 year postoperatively 
was used: "Overall, how are the problems now in the hip on 
which you had surgery, compared to before your operation?" 
The 5-point Likert scale response categories were: "Much 
better", "A little better", "About the same", "A little worse", 
and "Much worse". Change in general health from preoper- 
atively to postoperatively was evaluated by using the ques- 
tion: "In general, would you say your health is..." (Kamper- 
J0rgensen 2005). The 5 point Likert scale response categories 
were: "Excellent", "Very good", "Good", "Fair", and "Poor". 



Patients were asked to assess their general health status both 
preoperatively and postoperatively. For MCII estimation, 
change from preoperatively to postoperatively was calculated 
and patients answering 1 category better postoperatively than 
preoperatively were considered to be reporting a minimal 
clinically important improvement in general health (Tubach et 
al. 2005, Hays et al. 2005). For the MCII estimation, we used 
both a retrospective transition anchor and an absolute change 
anchor (Tubach et al. 2005), to circumvent the recall bias 
known to be a problem for retrospective anchors (King 201 1). 

Anclior questions used for definition of PASS. A hip-specific 
anchor question describing the patients' perception of outcome 
after surgery was used: "How would you describe the result of 
your operation?" There were 5 response options: "Excellent", 
"Very good", "Good", "Fair", and "Poor". Patients answering 
"Excellent", "Very good" or "Good" were considered to be 
reporting a hip-specific acceptable symptom state 1 year after 
THA. The general-health PASS estimation was based on the 
patient-reported postoperative general health status, using the 
same anchor question as for the general-health MCII estima- 
tion: "In general, would you say your health is..." (Kamper- 
J0rgensen 2005). If a patient answered "Excellent", "Very 
good" or "Good" for the postoperative general health anchor 
question, he or she was considered to have an acceptable gen- 
eral health state 1 year after THA. 

The PROs 

Both the HOOS and the EQ-5D are feasible to use in large- 
scale studies of THA (Paulsen et al. 2012b). 

HOOS and HOOS-PS. Hip dysfunction and Osteoarthritis 
Outcome Score (HOOS) (Nilsdotter et al. 2003) is a hip-spe- 
cific questionnaire constructed by adding questions considered 
important by patients (concerning pain, sport and recreation, 
function, hip-related quality of life, and other symptoms) to 
the WOMAC (Bellamy et al. 1988) to improve its validity for 
those with less severe disease or higher demands of physical 
function. The HOOS includes 5 subscales: Pain, Other Symp- 
toms, Function in Daily Living, Function in Sport and Recre- 
ation, and Hip-related Quality of Life. HOOS-PS is a 5-ques- 
tion short version derived using Rasch analysis (Rasch 1960) 
from the 2 HOOS subscales addressing physical function 
(ADL and Sport/Rec) by using data from samples representing 
a spectrum of osteoarthritis (OA) severity (Davis et al. 2008b). 
The HOOS-PS has been validated for use in THA (Davis et al. 
2008a). For the purpose of our study, 3 different HOOS sub- 
scales were used: HOOS Pain, HOOS-PS, and HOOS QoL. A 
score of 100 within each subscale indicates no problems and 0 
indicates extreme problems. A user guide and a scoring manual 
are available at http://www.koos.nu/index.html. 

EQ-5D. EQ-5D (The EuroQol Group 1990, Brooks 1996) is 
a well-established generic health outcome measure specially 
designed to complement other quality-of-life measures, or dis- 
ease-specific outcome measures, and it has also been used for 
economic evaluation of healthcare for several years (Dawson 
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et al. 2001, Linde 2009, Rolfson 2010). Patients describe their 
own health state in 5 dimensions: mobility, self-care, usual 
activities, pain/discomfort, and anxiety/depression on 1 out of 
3 levels of severity: no problems, some/moderate problems, 
or extreme problems. Patients are also asked to evaluate their 
current state of health on a thermometer scale from 0 ('worst 
imaginable') to 100 ('best imaginable'). EQ-5D generates 2 
overall values for quality of life, one from the patient's per- 
spective (the EQ-VAS: "Current state of health") and the other 
from a societal perspective, the EQ-5D Index (a health pro- 
file that can be transformed into a global health index with a 
weighted total value for health-related quality of life), which 
represent the patients' description of their own health and how 
this health state relates to the health state of the general popu- 
lation. A Danish tariff ranging from -0.624 (worst) to 1.000 
(best) (Wittrup-Jensen et al. 2009), based on time trade-off 
(Dolan and Roberts 2002), was used when computing the 
EQ-5D Index to adjust for culture-specific response patterns. 
A license was obtained from the EuroQol Group (http://www. 
euroqol.org/). 

Statistics 

No percentages (only numbers) are presented where n < 100. 
Any p-value less than 0.05 was considered significant. We 
used chi-square test and Wilcoxon-Mann-Whitney test to 
compare characteristics of patients who accepted or declined 
study participation. The correlation between the change 
anchor questions and the change scores for the PRO subscales 
for MCII and the correlation between the postoperative anchor 
questions and the postoperative scores for the PRO subscales 
for PASS were tested with Spearman's rank correlation coef- 
ficients. The proportions (%) of patients reporting the dif- 
ferent response categories for the anchor questions and the 
corresponding PRO change scores and postoperative PRO 
scores were calculated. PASS for subgroups of different age 
groups, sex, and diagnoses were estimated. The mean PRO 
change scores for the different subgroups were calculated 
for all patients (not only patients answering "a little better" 
for the hip-improvement anchor or all patients answering 1 
category better postoperatively than preoperatively for the 
general-health anchor), t-based confidence intervals (CIs) 
were used, as samples were of adequate size (n > 30) for 
the central limit theorem to apply and since visual inspec- 
tion revealed reasonably symmetric distributions in the few 
smaller groups (n > 15, see Supplementary data. Tables 6 and 
7). MCII values for subgroups were not estimated due to small 
subgroups. Depending on the number of subgroups, Welch's 
t-test or a so-called W test (similar to standard ANOVA F test) 
(Wilcox et al. 1986, Mitchell 2000), both allowing for unequal 
variances across groups, was used for comparison of means 
between subgroups. Absolute scores of the different HOOS 
subscales, EQ-5D Index, and EQ-VAS were calculated pre- 
operatively and postoperatively for each individual patient, as 
well as change scores from preoperatively to postoperatively. 
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In addition, mean preoperative and postoperative PRO scores 
and mean change scores for the entire study population were 
calculated. Missing data were handled in accordance with 
the manual for each EQ-5D and HOOS (Roos 2003, Cheung 
Kajang et al. 2009). 

MCII and PASS cut-points. 6 different methods were used 
to estimate MCII and PASS cut-points. First, cut-points were 
estimated by the mean change or mean score approach (de Vet 
et al. 201 1). This was regarded as the primary approach. Sec- 
ondly, cut-points were estimated by the seventy-fifth percen- 
tile approach (Tubach et al. 2005, Maksymowych et al. 2007, 
Kvien et al. 2007, Heiberg et al. 2008, Kvamme et al. 2010, 
Dougados et al. 2012). Thirdly, cut-points were estimated by 
the seventy-fifth percentile approach using tertiles (lowest-, 
middle-, and highest subscale scores) of the preoperative PRO 
scores (Tubach et al. 2006), to avoid misclassifying those with 
baseline scores only allowing for minor improvements (Davis 
et al. 2012). Fourthly, cut-points were estimated from receiver 
operating characteristic (ROC) curves using the 80% specific- 
ity rule (Aletaha et al. 2009, Kvamme et al. 2010). Fifth, cut- 
points were estimated from ROC curves using the cut-point 
corresponding to the smallest residual sum of sensitivity and 
specificity (Froud 2002, Maksymowych et al. 2007, Heiberg 
et al. 2008, Dougados et al. 2012). Sixth, cut-points were esti- 
mated by ROC curves using the cut-point corresponding to 
a 45 -degree tangent line intersection (equivalent to the point 
at which the sensitivity and specificity are closest together) 
(Froud 2002). 

An identical procedure was applied using the general-health 
anchor questions and EQ-5D. Percentile-based 95% confi- 
dence intervals (CIs) for cut-points were estimated by non- 
parametric bootstrap (Fiellin and Feinstein 1998, Campbell 
and Torgerson 1999) using 2,000 replications (Tables 2 and 
3), because of some small groups (n < 30) in the tertile estima- 
tions. The area under the curve, AUC, (with CI) was calcu- 
lated for the 3 methods using ROC curves. A correlation of 
at least 0.3 between the PRO scores and the anchor questions 
was considered adequate for applying the ROC methods (King 
2011). Patients answering "A little better" were considered 
to be reporting a minimal clinically important improvement, 
as this is the standard methodology (King 2011), but where 
the AUC estimation of the ROC curves (Fawcett 2006) was 
below the proposed minimum of 0.523 (Kvamme et al. 2010), 
patients answering "A little better" or "Much better" were 
pooled into 1 group. STATA software Version 11.0 was used 
for all statistical analyses. 

Ettiics 

The study was approved by the Danish National Board of 
Health and the Danish Data Protection Agency (journal 
number 2008-41-2593 and 2012-41-1368), whereas accep- 
tance by the Science Ethics Committee of the Region of 
Southern Denmark was not required. All the patients gave 
their informed written consent to participate in the study. 
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1 ,837 patients received study invitation 



502 patients deciined study participation 
(27%) 



1,335 patients accepted study participation and 
answered the preoperative questionnaire set 
(73%) 



8 patients died before compietion of thie 
1 -year postoperative questionnaire set 

39 patients did not answere the 1 -year 
postoperative questionnaire set 



1 ,288 patients answered the 1 -year 
postoperative questionnaire set 



49 patients with acute fractures were 
exciuded from the study 



1 ,239 patients included 



Figure 1 . Patient flow chart. 



Results 

Description of the study population 

Responders and non-responders. 1,335 of 1,837 patients 
(73%) agreed to participate in the study (Figure 1). Patients 
who declined study participation were older than those who 
were included (median age 70 years vs. 68 years (p < 0.001)), 
and they were less likely to have the diagnosis idiopathic 
osteoarthritis (84% vs. 90%, p = 0.001) (Table 1). 1,288 of 
1,335 patients (96%) answered the postoperative question- 
naire (Figure 1). 

Missing items. Missing items occurred in 13 to 140 items 
(1-11%) of the different items in the questionnaires. Preop- 
erative total scores could be calculated as follows: 98% for 
HOOS Pain, 98% for HOOS-PS, 99% for HOOS QoL, 95% 
for EQ-5D Index, and 98% for EQ-VAS according to rules 
for missing items. Postoperatively, total scores could be cal- 
culated as follows: 96% for HOOS Pain, 97% for HOOS-PS, 
97% for HOOS QoL, 95% for EQ-5D Index, and 96% for 
EQ-VAS. 

PRO scores. Distributions of preoperative and postoperative 
PRO scores are given in Figure 2 (see Supplementary data). 
Mean (CI) for the PRO scores pre- and postoperatively and 
change scores are given in Table 5 (see Supplementary data). 

Proportions reporting dijferent response options for the 
anchor questions. When responding to the MCII anchor ques- 
tion "Overall, how are the problems now in the hip on which 
you had surgery, compared to before your operation?", most 



Table 1 . Characteristics of the 1 ,837 patients who received an invi- 
tation to participate in the study in the period from IVlarch 1 , 201 0 
to IVlarch 1 , 201 1 



Pntipnt pharar'tpriQtipQ 

FCllldll Lil ICll OLild IOLIVjO 


PfltipnfQ whn 
n ciLid 1 lo vvi 


PntipntQ whn 


n-\/pli IP 

yj VcllUC 




ctljl ecu 








to participate 


participation 




Population (n (%)) 


1 ,335 (73) 


502 (27) 




Age ^ years, median (range) 


68 (23-94) 


70 (24-96) 


< 0.001 


18-50, n 


71 


27 


1.0 


50-70, n {%) 


706 (53) 


223 (44) 


0.001 


> 70, n {%) 


558 (42) 


252 (50) 


0.001 


Female sex, n (%) 


720 (54) 


295 (59) 


0.06 


Diagnosis, n (%) 








Idiopathic OA 


1,175 (90) 


397 (84) 


0.001 


Other arthritis 


45 


13 


0.5 


Childhood hip diseases 


30 


21 


0.02 


Sequel from fracture 


6 


5 


0.2 


Necrosis of femoral head 


7 


3 


0.8 



^ Age of patients on date of inclusion start. 

^ Idiopathic OA, other arthritis (Mb. Bechterew, other arthritis), 
childhood hip diseases (congenital hip dislocation, Mb. Calve-Legg- 
Perthes, epiphysiolysis, acetabular dysplasia), sequel from fracture 
(late sequel from fracture of proximal femur), and necrosis of femo- 
ral head (atraumatic necrosis of femoral head). Patients with frac- 
ture of acetabulum (29 patients accepted, 24 patients declined), 
traumatic hip dislocation (5 patients accepted, 6 patients declined), 
and fresh fracture of proximal femur (15 patients accepted, 4 
patients declined) were excluded from the study. 

patients (88%) considered themselves much improved and 77 
patients (6%) considered themselves a little improved 1 year 
after THA (Table 6, see Supplementary data). Regarding the 
anchor question "In general, would you say your health is. . .", 
34% reported 1 step better general health postoperatively as 
compared to preoperatively and 1 1 % reported 2 or more steps 
better general health (Table 7, see Supplementary data). 

Regarding PASS, postoperatively 92% of the patients 
described the results of their operation as at least good: 
"Excellent" (53%), "Very good" (27%), or "Good" (12%). A 
total of 82% of the patients described their general health as at 
least good: "Excellent" (12%), "Very good" (35%), or "Good" 
(35%) at 1 year postoperatively in contrast to 68% before sur- 
gery ("Excellent" (44 patients, 3%), "Very good" (24%) or 
"Good" (41%)). 338 patients (27%) reported scores for all 3 
HOOS subscales at or above the "Much better" cut-point, and 
881 patients (71%) reported scores for all 3 HOOS subscales 
at or above the "A little better" cut-point. PRO change scores, 
mean preoperative scores, and mean postoperative scores for 
different anchor answer categories are given in Figures 3 and 
4 (see Supplementary data). 

Correlation of PROs and anchor questions 

The correlations between the hip- specific anchor used for 
MCII estimation and the PRO change scores were higher than 
0.30 (p < 0.001) for all PRO subscales, except for the EQ-5D 
Index (-0.27; p < 0.001) and EQ-VAS (-0.25; p < 0.001). 
The correlations between the general-health anchor used for 
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Table 2. Minimal clinically important improvement (MCII) for HOOS and EQ-5D. Values are cut-point (95% CI) or AUG 



43 



IVICII 

75tli percentile approacii ^ Receiver operating cliaracteristic (ROC) approach 

Mean 

cliange All patients Lowest Middle Highest 80% speci- Maximal 45° tangent 
PRO approach ^ tertile " tortile " tortile " ficity "= accuracy " line ^ AUC ' 



Estimated from the hip-specific anchor question: "Overall, how are the problems now in the hip on which you had surgery, compared to before 



your operation?" 




















HOOS Pain 


24 


33 


49 


31 


23 


28 


33 


33 


0.91 


95% CI 


(20-28) 


(29-43) 


(34-55) 


(25-40) 


(17-30) 


(18-33) 


(29-40) 


(33-39) 


(0.87-0.94) 


HOOS-PS 


23 


35 


43 


36 


25 


23 


35 


34 


0.89) 


95% CI 


(19-28) 


(25-40) 


(32-54) 


(25-45) 


(20-32) 


(15-35) 


(27-50) 


(31-37) 


(0.85-0.94) 


HOOS QoL 


17 


25 


31 


25 


11 


19 


38 


27 


0.93 


95% CI 


(12-22) 


(19-30) 


(25-45) 


(19-28) 


(6-19) 


(13-31) 


(32-39) 


(28-39) 


(0.91-0.96) 


EQ-5D Index 9 


0.14 


0.27 


0.40 


0.10 


0.01 


0.33 


0.16 


0.18 


0.77 


95% CI 


(0.10-0.18) 


(0.13-0.40) 


(0.40-0.48) 


(0.07-0.11) 


(0-0.05) 


(0.13-0.33) 


(0.12-0.23) 


(0.14-0.24) 


(0.70-0.84) 


EQ-VAS 9 


7 


20 


22 


20 


5 


15 


23 


11 


0.76 


95% CI 


(1-12) 


(15-21) 


(20-50) 


(10-25) 


(-13-15) 


(10-20) 


(3-28) 


(12-16) 


(0.70-0.82) 


Estimated based on the change from preoperatively to postoperatively and the general-health anchor question: "In general, would you say your 


health is..." 




















EQ-5D Index 9 


0.31 


0.40 


0.67 


0.34 


0.23 


0.41 


0.18 


0.27 


0.58 


95% CI 


(0.29-0.34) 


(0.35-0.45) 


(0.67-0.68) 


(0.34-0.34) 


(0.22-0.23) 


(0.37-0.44) 


(0.07-0.34) 


(0.28-0.29) 


(0.55-0.62) 


EQ-VAS 


23 


35 


51 


30 


15 


35 


12 


18 


0.60 


95% CI 


(21-25) 


(32-40) 


(50-55) 


(27-30) 


(14-15) 


(30-40) 


(7-14) 


(16-20) 


(0.57-0.63) 



^ All patients answering "a little better" to the hip-improvement anchor or all patients answering to the general-health anchor 1 category better 
postoperatively than preoperatively. 

^ Tertiles (lowest third, middle third, highest third) of preoperative PRO scores. 
Cut-point corresponding to the 80% specificity rule; all patients answering "a little better" and "much better" to the hip-improvement anchor 
or all patients answering to the general-health anchor 1 category better postoperatively than preoperatively. 

Cut-point corresponding to the smallest residual sum of sensitivity and specificity; all patients answering "a little better" to the hip-improve- 
ment anchor or all patients answering to the general-health anchor 1 category better postoperatively than preoperatively. 

^ Cut-point corresponding to a 45-degree tangent line intersection, equivalent to the point at which the sensitivity and specificity are clos- 
est together; all patients answering "a little better" to the hip-improvement anchor or all patients answering to the general-health anchor 1 
category better postoperatively than preoperatively. 

* Area under the curve; all patients answering "a little better" and "much better" to the hip-improvement anchor, from the 80% specificity 
approach, or all patients answering to the general-health anchor 1 category better postoperatively than preoperatively, from the 80% speci- 
ficity approach. 

9 Anchor-PRO correlation < 0.30. 



MCII estimation and the PRO change scores were higher than 
0.30 for EQ-VAS (0.35; p < 0.001), but that for EQ-5D Index 
was 0.27 (p < 0.001). The correlations between both the hip- 
specific anchor and the general-health anchor used for PASS 
estimation and the postoperative PRO scores were higher than 
0.50 for all PRO subscales (p < 0.001), except for the cor- 
relation between the hip-specific anchor and EQ-VAS (-0.48; 
p < 0.001) (Table 8, see Supplementary data). 

MCII cut-points for PRO change scores 

MCII cut-points for HOOS based on the mean score change 
method and the hip-specific anchor question "Overall, how 
are the problems in your hip now compared to preopera- 
tively?" were 24 (CI: 21-29) for HOOS Pain, 23 (CI: 19-30) 
for HOOS-PS, and 17 (CI: 13-21) for HOOS QoL (Table 2). 
Thus, an improvement of 24 points in the HOOS Pain subscale 
corresponds to the minimal improvement in pain considered 
clinically important by THA patients. Estimates based on 
the 5 other calculation methods (the seventy-fifth percentile 
approach (all patients and by tertiles), the 80% specificity rule. 



the 45-degree tangent Une intersection cut-point, and the cut- 
point corresponding to the smallest residual sum of sensitivity 
and specificity) are presented in Table 2. The estimated MCII 
cut-points for EQ-5D Index and EQ-VAS based on a general- 
health anchor were 0.31 (CI: 0.29-0.33) and 23 (CI: 21-25), 
respectively (Table 2). The MCII was dependent on baseline 
score, as lower tertiles corresponded to higher MCII for all 
PROS (Table 2). 

PASS cut-points for postoperative PRO scores 

PASS cut-points for the HOOS subscales when answering 
"Excellent", "Very good", or "Good" to the question "How 
would you describe the results of your operation?" were 91 
(CI: 91-92) for HOOS Pain, 88 (CI: 87-88) for HOOS-PS, 
and 83 (CI: 82-84) for HOOS QoL (Table 3). 

Thus, a postoperative score of 9 1 in the HOOS Pain subscale 
corresponds to a symptom state considered acceptable 1 year 
after THA. The cut-points representing PASS when reporting 
the general health postoperatively were 0.92 (CI: 0.91-0.92) 
for EQ-5D Index and 85 (CI: 84-86) for EQ-VAS (Table 3). 
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Table 3. Patient-acceptable symptom state (PASS) for HOOS and EQ-5D. Values are cut-point (95% CI) or AUG 



PASS 

75tli percentile approacii ^ Receiver operating cliaracteristic (ROC) approach 
iVIean 



score Aii patients Lowest iVliddie Higiiest 80% speci- iVlaximai 45° tangent 
PRO approacli ^ tertile " tertiie " tertile " ficity "= accuracy " line ^ AUC ' 



Estimated from the hip-specific anchor question: "How would you describe the results of your operation?" 



HOOS Pain 


91 


100 


100 


100 


100 


75 


75 


81 


0.90 


95% CI 


(91-92) 


(100-100) 


(100-100) 


(100-100) 


(100-100) 


(68-83) 


(71-84) 


(76-84) 


(0.86-0.94) 


HOOS-PS 


88 


100 


100 


100 


100 


75 


65 


75 


0.90 


95% CI 


(87-89) 


(100-100) 


(95-100) 


(100-100) 


(100-100) 


(65-80) 


(61-81) 


(71-76) 


0.87-0.94) 


HOOS QoL 


83 


100 


100 


100 


100 


50 


58 


56 


0.94 


95% CI 


(82-85) 


(100-100) 


(94-100) 


(100-100) 


(100-100) 


(44-56) 


(51-59) 


(51-64) 


(0.92-0.97) 


EQ-5D-lndex 


0.90 


1 


1 


1 


1 


0.72 


0.77 


0.77 


0.92 


95% CI 


(0.89-0.91) 


(1-1) 


(1-1) 


(1-1) 


(1-1) 


(0.72-0.77) 


(0.73-0.80) 


(0.73-0.79) 


(0.90-0.94) 


EQ-VAS 


82 


95 


90 


90 


95 


70 


70 


70 


0.86 


95% CI 


(81-83) 


(90-95) 


(90-90) 


(90-95) 


(95-98) 


(70-80) 


(71-76) 


(71-71) 


(0.82-0.89) 


Estimated based on the postoperative answers to the general-health anchor question: 


"In general, would you say your health is. . . 




EQ-5D Index 


0.92 


1 


1 


1 


1 


0.82 


0.79 


0.82 


0.84 


95% CI 


(0.91-0.92) 


(1-1) 


(1-1) 


(1-1) 


(1-1) 


(0.78-0.84) 


(0.79-0.83) 


(0.79-0.83) 


(0.81-0.88) 


EQ-VAS 


85 


95 


90 


94 


95 


75 


72 


75 


0.89 


95% CI 


(84-86) 


(94-95) 


(90-95) 


(90-95) 


(95-98) 


(70-80) 


(71-76) 


(73-80) 


(0.86-0.91) 



^ All patients with an "excellent", "very good" or "good" postoperative operation result or general health. 
Tertiles (lowest third, middle third, highest third) of preoperative PRO scores. 

Cut-point corresponding to the 80% specificity rule; all patients with an "excellent", "very good", or "good" operation result or general health 
postoperatively. 

Cut-point corresponding to the smallest residual sum of sensitivity and specificity; all patients with an "excellent", "very good", or "good" 

operation result or general health postoperatively. 
^ Cut-point corresponding to a 45-clegree tangent line intersection, equivalent to the point at which the sensitivity and specificity are closest 

together; all patients with an "excellent", "very good", or "good" operation result or general health postoperatively. 
* Area under the curve; all patients with an "excellent", "very good", or "good" operation result or general health postoperatively. 



The PASS was independent of baseline score for all PROs, 
since identical PASSs were achieved for the different tertiles 
of baseline scores. 

Subgroup estimations. The gender-dependent differences 
in favor of males in mean postoperative scores ranged from 
4-5 points on a 0-100 scale for the HOOS subscales and the 
EQ-VAS (p < 0.001), with no significant differences in PRO 
change scores between males and females (p a 0.3). Patients 
with idiopathic OA had significantly better postoperative 
PRO scores than patients with other diagnoses (4-1 1 points 
on a 0-100 scale for the HOOS subscales and the EQ-VAS; 
p s 0.03), with no significant differences in PRO change 
scores (p a 0.06). Patients over 70 years had the worst mean 
PRO change scores for HOOS Pain, HOOS-PS, and EQ-VAS 
(3-11 points less than other age groups; p < 0.003). For 
HOOS-PS and EQ-VAS, the oldest also had the worst post- 
operative mean scores (4-5 points less; p s 0.002). Males had 
higher PASS estimates than females (p s 0.04), OA patients 
had higher PASS estimates for HOOS QoL and EQ-5D Index 
than other patients (p < 0.008), and patients over 70 years had 
lower PASS estimates than younger patients for HOOS Pain, 
HOOS-PS, and EQ-VAS (p < 0.03) (Table 4). 



Discussion 

We aimed to establish THA patients' viewpoints on clini- 
cally important improvements and acceptable symptom state 
cut-points based on changes in PRO scores and postoperative 
PRO scores. A better understanding of these cut-points would 
contribute to our interpretation of change in PRO scores and 
postoperative PRO scores following orthopedic procedures. 
Our results, showing that MCII varied with baseline score 
and that PASS varied with gender, diagnosis (for HOOS QoL 
and EQ-5D Index), and age (for HOOS Pain, HOOS-PS, and 
EQ-VAS), support current understanding that there is a range 
of values influenced by factors such as patient group, interven- 
tion, time to follow-up, and methodological issues including 
cut-point chosen for the anchor question — and also choice of 
anchor-based methods or distribution-based methods (King 
2011). 

MCII 

In earlier reports, MCII for EQ-5D has been found to vary 
considerably (0-0.69) in patients with rheumatoid arthritis, 
psoriatic arthritis, and ankylosing spondylitis after 3 months 
of treatment with disease-modifying anti-rheumatic drugs, 
indicating that MCII is dependent on patient group (Kvamme 
et al. 2010). In addition, the mean MCII for the EQ-5D Index 
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Table 4. Patient-acceptable symptom state (PASS) for different subgroups; mean (95% CI) 



Subgroups 

Males Females p-value ^ Idiopatic Other p-value ^ < 50 years 50-70 years > 70 years p-value 
PRO OA diagnoses 



Estimated by the mean score method using the anchor question: "How would you describe the results of your operation?" 



HOOS Pain 


93 


90 


<0.001 


92 


90 


0.3 


93 


92 


90 


0.03 


95% CI 


(92-94) 


(89-91) 




(91-92) 


(87-93) 




(91-95) 


(91-93) 


(89-92) 




HOOS-PS 


90 


86 


<0.001 


88 


86 


0.3 


91 


89 


85 


<0.001 


95% CI 


(88-91) 


(85-88) 




(87-89) 


(83-90) 




(89-94) 


(88-90) 


(84-87) 




HOOS OoL 


85 


82 


0.005 


84 


77 


0.008 


80 


83 


84 


0.4 


95% CI 


(84-87) 


(80-83) 




(83-85) 


(72-82) 




(75-85) 


(82-85) 


(82-86) 




EQ-5D Index 


0.91 


0.89 


0.002 


0.90 


0.85 


0.004 


0.90 


0.91 


0.89 


0.03 


95% CI 


(0.90-0.92) 


(0.88-0.90) 




(0.90-0.91) 


(0.82-0.89) 




(0.87-0.93) 


(0.90-0.92) 


(0.87-0.90) 




EQ-VAS 


85 


80 


<0.001 


82 


80 


0.3 


86 


84 


79 


<0.001 


95% CI 


(83-86) 


(78-81) 




(81-83) 


(77-84) 




(82-89) 


(83-85) 


(77-81) 




Estimated by the mean score method using the anchor question: "In general, would you say your health 


is..." 






EQ-5D Index 


0.93 


0.91 


0.04 


0.92 


0.87 


0.003 


0.91 


0.92 


0.92 


0.9 


95% CI 


(0.92-0.94) 


(0.90-0.92) 




(0.91-0.93) 


(0.84-0.90) 




(0.88-0.94) 


(0.91-0.93) 


(0.90-0.93) 




EQ-VAS 


87 


83 


<0.001 


85 


83 


0.2 


88 


86 


83 


0.01 


95% CI 


(86-88) 


(82-85) 




(84-86) 


(80-86) 




(85-90) 


(85-87) 


(82-85) 





« Welch's t-test. 
" W-test. 



was on average 0.074 (range -0.01 1 to 0.140) in 8 longitudinal 
studies with 1 1 patient groups (hip OA or THA patients not 
included) (Walters and Brazier 2005). This confirms that there 
are no universal cut-points for a single PRO, as the estimates 
will vary by population and context (King 201 1), which is sup- 
ported by the present study. An MCll of 15 of 100 for absolute 
improvement has been recommended for 4 different generic 
PROs in chronic rheumatic diseases (Tubach et al. 2012). The 
smaller MCll recommended by Tubach et al. (compared to 
our study) can be explained by the different patient popula- 
tions (chronic rheumatic diseases vs. THA patients), the dif- 
ferent interventions (non-steroidal anti-inflammatory drugs 
vs. THA), and the much longer follow-up time of 1 year in our 
study as compared to 4 weeks. Our finding of an MCll of 0.3 1 
for the EQ-5D Index corresponds well with previous findings 
of 0.32 (anchor-based methods, identical anchors, and esti- 
mation approach) and 0.42 (distribution-based methods) for 
THA patients 6 months after surgery (Browne et al. 2007). We 
found that the MCII was dependent on baseline score, and it is 
likely that the poorer patient scores (for instance, for QoL or 
general health) prior to the THA, the more they may gain from 
the operation (Figure 4, see Supplementary data) (Tubach et 
al. 2006, Quintana et al. 2012). 

Many similar terms and definitions related to the minimal 
important difference have been used (King 2011). We have 
used MCll, which is similar to the minimal clinically impor- 
tant difference, except that MCll only addresses the direction 
of improvement and not worsening (Tubach et al. 2009). We 
calculated MCII estimates by several approaches and the mean 
change approach was considered the primary approach, as this 
is robust and may seem more intuitive and more easy to inter- 
pret than the complex ROC approaches (de Vet et al. 2007). 



We used a hip-specific anchor for MCII for the hip-specific 
PROs as recommended (Revicki et al. 2008, Shi et al. 2010), 
because there should be a theoretical basis for the relationship 
between the anchor and the relevant domain (King 201 1). For 
the same reason, the general-health anchor was regarded to 
be the most important for the EQ-5D MCII estimation. The 
EQ-VAS had an adequate correlation to the general-health 
anchor. The EQ-5D Index time trade-off valuation methodol- 
ogy has recently been criticized for lacking validity (Augestad 
and Rand-Hendriksen 2012), which may explain our find- 
ing of the EQ-5D Index's suboptimal correlation to both the 
general-health anchor and the hip-specific anchor. By using 
different anchor questions, different concepts are examined; 
a smaller change in PRO score is important for the patient 
when reporting hip improvement, compared to a substantially 
Icirger change in PRO score that is required to make an impact 
on the minimal important improvement in the patient's gen- 
eral health. When estimating MCII after THA, hip-specific or 
general-health anchors should be chosen based on whether the 
PRO is a hip-specific or a general-health outcome measure, 
but also depending on whether the focus is determining hip 
improvement after THA or determirung the impact of THA on 
general health. 

Due to the overall good results from THA, an important 
question is whether "A little better" is a clinically relevant out- 
come 1 year after THA; considering the costs and risks of the 
procedure, patients could be expected to be "Much better". A 
high proportion of the patients indeed got "Much better", but a 
lesser improvement was acknowledged to be clinically impor- 
tant also, because an MCII cut-point estimation based solely 
on "Much better" would take 73% of the patients below the 
MCII cut-point in at least 1 HOOS subscale, which is clearly 
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too many and would reduce the clinical value of this cut-point. 
MCIIs were estimated for patients who reported being "A 
little better", but also for patients answering "A little better" 
or "Much better" combined, where the AUC estimations of 
the ROC curves were below the proposed minimum (Kvamme 
et al. 2010), and the results were similar (Table 2). In Figures 
3 and 4 (see Supplementary data), HOOS Pain and EQ-5D 
Index are shown as examples to illustrate mean scores for the 
different answer categories in comparison to a hip-specific 
anchor and a general-health anchor, respectively. 

PASS 

To our knowledge, there have been no previous PASS estima- 
tions for HOOS and EQ-5D using THA patients. PASS for 
rheumatoid arthritis patients has been estimated to be around 
0.70 in EQ-5D Index in patients after 3 months of treatment 
with disease-modifying anti-rheumatic drugs (Kvamme et 
al. 2010). PASS has been reported to be 40 of 100 for abso- 
lute improvement, in different generic PROs in patients with 
chronic rheumatic diseases (Tubach et al. 2012). The smaller 
PASSs found in these studies may be explained by the dif- 
ferent interventions and the different patient populations. 
Patients' expectations for an acceptable symptom state may 
be higher in THA patients due to the intervention itself, and 
chronic rheumatic patients may have a lower threshold for an 
acceptable symptom state due to the nature of their disease. 

PASS was found to be independent of baseline scores, 
which is supported by a previous study (Escobar et al. 2012). 
It has been noted that the methodology for identification of 
PASS may influence the identified cut-points, and that the 
ROC approach generally provides estimates that are some- 
what lower than the cut-points identified with the seventy-fifth 
percentile approach (Kvien et al. 2007), similar to what was 
seen in our data. For THA patients, the operation result and 
the postoperative general health seem to be equally depen- 
dent on the intervention. The THA has to eliminate most of 
the patient's symptoms for patients to achieve an acceptable 
symptom state, since PASS is quite close to the best possible 
score for the PROs studied. Others have found a greater change 
from preoperative to postoperative evaluation in both EQ-5D 
Index and EQ-VAS than in our study (Browne et al. 2007, 
Chard et al. 201 1, Rolfson et al. 201 1). This can be explained 
by our patients having better preoperative (and postoperative) 
scores, and therefore less potential for improvement (Table 5, 
see Supplementary data). 

Limitations and strengths. Several methodological limita- 
tions should be taken into consideration when interpreting 
the results of the present study. Patients who declined to par- 
ticipate in the study were on average 2 years older and more 
seldom operated due to idiopathic OA than the patients who 
were included. There was a higher percentage of idiopathic 
OA patients in our study population than in the Danish THA 
population, which may have reduced the external validity of 
the study, but the mean age and proportion of women in our 



study are very similar to figures for the total Danish THA pop- 
ulation. The feasibility of the PROs included has previously 
been assessed in a THA population (Paulsen et al. 2012b), but 
the additional questions included were not psychometrically 
validated as a part of this study. 2 of the anchor questions were 
translated from English without a formal cross-cultural valida- 
tion. As previously reported by others (Jansson and Granath 
2010), EQ-5D Index had a bi-modal distribution in our data, 
introducing more uncertainties in our results than described 
by the confidence intervals and p-values. EQ-5D Index had 
anchor correlations of less than 0.30 for both the hip-specific 
change anchor and the general-health change anchor. This is a 
limitation of the EQ-5D Index MCII estimation. The subopti- 
mal correlation between EQ-5D Index and the general-health 
anchor may well be an artifact from the time trade-off valu- 
ation process, as this process not only includes the patients' 
descriptions of their own health, but also how this health state 
is perceived by the general population. For all other MCII 
estimations, the anchor-PRO correlations were moderate, but 
below 0.50. MCII estimations were based on a retrospective 
transition anchor for HOOS and an absolute change anchor 
(postoperative score - baseline score) (Hays et al. 2005, 
Tubach et al. 2005) for EQ-5D, and the use of different anchor 
types for different PROs could be viewed as a limitation. 

Despite the minor differences in patient characteristics, we 
consider the results to have high external validity due to the 
inclusion of approximately 15% of the entire Danish THA 
production of 2010 from 16 centers dispersed all over Den- 
mark, centers with both low and high production, public as 
well as private, and both university hospitals and community 
hospitals. MCII and PASS estimations were based on hip-spe- 
cific anchors for hip-specific HOOS subscales and on general- 
health anchors for the general-health focused EQ-5D Index 
and EQ-VAS. Several estimation approaches were used, and 
all estimations are reported (Tables 2 and 3). In all PASS esti- 
mations, the anchor-PRO correlations were over 0.50, except 
for the EQ-VAS (-0.48). 

Conclusion 

Using a population-based cohort design, we determined 
cut-points for the change representing the MCII and for the 
postoperative score representing the PASS 1 year after THA 
for HOOS Pain, HOOS-PS, HOOS QoL, EQ-5D Index, and 
EQ-VAS. This study facilitates interpretability of PRO scores 
and may improve understanding of PRO findings in future 
THA outcome studies. MCIIs corresponded to a 38-55% 
improvement from mean baseline PRO score and PASSs cor- 
responded to absolute follow-up scores of 57-91% of the 
maximum score in THA patients 1 year after surgery, which 
may serve as reference values in registry settings. 

Supplementary data 

Tables 5-8 and Figures 2-4 are available at Acta's website 
(www.actaorthop.org), identification number 6152. 
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